-
Notifications
You must be signed in to change notification settings - Fork 147
feat: add TORCHX_IMAGE to env vars for Docker-based schedulers (#1128) #1129
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
For your consideration, @kiukchung @andywag @d4l3k |
|
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #1129 +/- ##
=======================================
Coverage 91.63% 91.63%
=======================================
Files 83 83
Lines 6392 6397 +5
=======================================
+ Hits 5857 5862 +5
Misses 535 535
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
The lint failure seems unrelated, @kiukchung |
this is unrelated to your changes I'll override it. We're experiencing a few broken assets due to the migration of this repo from github.com/pytorch/torchx to github.com/meta-pytorch/torchx fix for doctest is: #1131 but it seems that our AWS creds are broken at the moment. |
|
Sounds good, @kiukchung! Looks like |
|
@kiukchung has imported this pull request. If you are a Meta employee, you can view this in D83282650. |
Adding
TORCHX_IMAGEto environment variables so that we give a chance to the payload to record the value beyond the retention restrictions of the scheduler so as to facilitate reproducibility.Test plan:
[x] updated unit tests
[x]
local_docker --workspace ""-> no image built or pushed -> we expect to see the base image inTORCHX_IMAGEenv varstorchx run -s local_docker --workspace "" utils.sh --image alpine:latest -- env[x]
local_docker-> image built locally, but not pushed -> we expect to see local image SHA inTORCHX_IMAGEenv varstorchx run -s local_docker utils.sh --image alpine:latest -- env[x]
aws_batch --image_repo-> image built locally, tagged and pushed to image repo -> we expect to see image tag inTORCHX_IMAGEenv varstorchx run -s aws_batch --scheduler_args 'queue=<queue-name>,priority=<priority>,image_repo=<image-repo>,share_id=<share-id>' utils.sh -h g5.4xlarge --image alpine:latestObserve in Batch Job details: